Corpus: ara_web_2011_30K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 95 96 99 99 99
1000 849 989 997 997 998
10000 6387 9310 9706 9817 9849
100000 15370 26085 28095 28773 28988
1000000 15370 26085 28095 28773 28988


Zipf's diagram for sentence endings


Gnuplot diagram

2115 msec needed at 2018-04-04 03:28